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TFT-BOOTSTRAP: RESAMPLING TIME SERIES IN THE 
FREQUENCY DOMAIN TO OBTAIN REPLICATES 
IN THE TIME DOMAIN 1 
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Karlsruhe Institute of Technology (KIT) and 
University of California, San Diego 

A new time series bootstrap scheme, the time frequency tog- 
gle (TFT)-bootstrap, is proposed. Its basic idea is to bootstrap the 
Fourier coefficients of the observed time series, and then to back- 
transform them to obtain a bootstrap sample in the time domain. 
Related previous proposals, such as the "surrogate data" approach, 
resampled only the phase of the Fourier coefficients and thus had 
only limited validity. By contrast, we show that the appropriate re- 
sampling of phase and magnitude, in addition to some smoothing 
of Fourier coefficients, yields a bootstrap scheme that mimics the 
correct second-order moment structure for a large class of time se- 
ries processes. As a main result we obtain a functional limit theorem 
for the TFT-bootstrap under a variety of popular ways of frequency 
domain bootstrapping. Possible applications of the TFT-bootstrap 
naturally arise in change-point analysis and unit-root testing where 
statistics are frequently based on functionals of partial sums. Finally, 
a small simulation study explores the potential of the TFT-bootstrap 
for small samples showing that for the discussed tests in change-point 
analysis as well as unit-root testing, it yields better results than the 
corresponding asymptotic tests if measured by size and power. 

1. Introduction. Following Efron's seminal paper [17] on the i.i.d. boot- 
strap, researchers have been able to apply resampling ideas in a variety 
of non-i.i.d. situations including the interesting case of dependent data. 
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Biihlmann [6] , Lahiri [36] and Politis [50] give reviews of the state-of-the-art 
in resampling time series and dependent data. 

In the last two decades, in particular, resampling methods in the frequency 
domain have become increasingly popular (see Paparoditis [41] for a recent 
survey). One of the first papers to that effect was Franke and Hardle [18] 
who proposed a bootstrap method based on resampling the periodogram in 
order to devise confidence intervals for the spectral density. The idea be- 
hind that approach is that a random vector of the periodogram ordinates 
at finitely many frequencies is approximately independent and exponentially 
distributed (cf., e.g., Brockwell and Davis [5], Theorem 10.3.1). Later this ap- 
proach was also pursued for different set-ups, for example, for ratio statistics 
such as autocorrelations by Dahlhaus and Janas [15] or in regression models 
by Hidalgo [23]. Dahlhaus and Janas [27] suggested a modification of the pe- 
riodogram bootstrap which leads to a correct approximation for a wider class 
of statistics such as the sample auto-covariance which — in contrast to the 
sample autocorrelation — is not a ratio statistic. Kreiss and Paparoditis [34] 
propose the autoregressive-aided periodogram bootstrap where a parametric 
time domain bootstrap is combined with a nonparametric frequency domain 
bootstrap in order to widen the class of statistics for which the bootstrap is 
valid. 

We will refer to the above methods as periodogram bootstrapping as all 
of the statistics of interest there were functionals of the periodogram. Since 
these bootstrap methods resample the periodogram, they generally do not 
produce bootstrap pseudo-series in the time domain. A recent exception is 
a "hybrid" bootstrap of Jentsch and Kreiss [28], that is, an extension of the 
aforementioned method of Kreiss and Paparoditis [34]. 

We now wish to focus on two well-known proposals on frequency-domain 
bootstrap methods that also yield replicates in the time domain, notably: 

• The early preprint by Hurvich and Zeger [25] who proposed a parametric 
bootstrap very similar to our TFT wild bootstrap of Section 2, as well as 
a nonparametric frequency-domain bootstrap based on prewhitening via 
an estimate of the MA(oo) transfer function. Although never published, 
this paper has had substantial influence on time series literature as it 
helped inspire many of the above periodogram bootstrap methods. Note 
Hurvich and Zeger [25] provide some simulations but give no theoretical 
justification for their proposed procedures; indeed, the first theoretical 
justification for these ideas is given in the paper at hand as special cases 
of the TFT-bootstrap. 

• The "surrogate data" approach of Theiler et al. [57] has received signif- 
icant attention in the physics literature. The idea of the surrogate data 
method is to bootstrap the phase of the Fourier coefficients but keep their 
magnitude unchanged. While most of the literature focuses on heuristics 
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and applications, some mathematical proofs have been recently provided 
(see Braun and Kulperger [3], Chan [10], Mammen and Nandi [39], and 
the recent survey by Maiwald et al. [37]). The surrogate data method was 
developed for the specific purpose of testing the null hypothesis of time 
series linearity and is not applicable in more general settings. To see why, 
note that every surrogate sample has exactly the same periodogram (and 
mean) as the original sequence. Hence, the method fails to approximate 
the distribution of any statistic that is a function of first- and second- 
order moments, thus excluding all cases where periodogram resampling 
has proven to be useful; see our Proposition 2.1 in Section 2.1. 

In the paper at hand, we propose to resample the Fourier coefficients — 
which can effectively be computed using a fast Fourier transform (FFT) — in 
a variety of ways similar to modern periodogram bootstrap methods, and 
then obtain time series resamples using an inverse FFT. Since we start out 
with an observation sequence in the time domain, then jump to the frequency 
domain for resampling just to get back to the time domain again, we call 
this type of resampling a time frequency toggle (TFT) bootstrap. The TFT- 
bootstrap is an extension of existing periodogram bootstrap methods as 
it yields almost identical procedures when applied to statistics based on 
periodograms, but it is also applicable in situations where the statistics of 
interest are not expressible by periodograms; for more details we refer to 
Section 6. 

The TFT-bootstrap is related to the surrogate data approach but is more 
general since it also resamples the magnitudes of Fourier coefficients and not 
just their phases. As a result, the TFT is able to correctly capture the distri- 
bution of statistics that are based on the periodogram. The TFT, however, 
shares with the surrogate data approach the inability to approximate the 
distribution of the sample mean; luckily, there are plenty of methods in the 
bootstrap literature to accomplish that, for example, the block bootstrap 
and its variations, the AR-sieve bootstrap, etc. (for details, see Lahiri [36], 
Biihlmann [6], Politis [50]). 

In this paper we provide some general theory for the TFT-bootstrap which 
not only gives a long-due theoretical justification for one of the proposals by 
Hurvich and Zeger [25] but also allows for several modern extensions of these 
early ideas. In particular, we prove that the TFT sample has asymptotically 
the correct second-order moment structure (Lemma 3.1) and provide a func- 
tional central limit theorem (FCLT, Theorem 3.1 and Corollary 3.1) for the 
TFT-sample. This is a much stronger result than the asymptotic normality 
with correct covariance structure of a finite subset as proved, for example, 
by Braun and Kulperger [3] for the surrogate data method. 

As in the surrogate data method, the TFT sample paths are shown to 
be (asymptotically) Gaussian; so in a sense the TFT approximates possibly 
nonlinear time series with a Gaussian process having the correct second- 
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order moment structure. This seems to be inevitable in all methods us- 
ing discrete Fourier transforms due to the fact that Fourier coefficients are 
asymptotically normal under very general assumptions. However, in contrast 
to the surrogate data method, the TFT is able to capture the distribution 
of many useful statistics (cf. Section 6). For example, our FCLT implies the 
validity of inference for statistics such as CUSUM-type statistics in change- 
point analysis (cf. Section 6.2) or least-squares statistics in unit-root testing 
(cf. Section 6.3). The TFT-bootstrap is also valid for periodogram-based (ra- 
tio) statistics such as sample autocorrelations or Yule-Walker estimators; 
this validity is inherited by the corresponding results of the periodogram 
bootstrapping employed for the TFT (cf. Section 6.1). 

Furthermore, in many practical situations one does not directly observe 
a stationary sequence but needs to estimate it first. In Corollary 4.1 we 
prove the validity of the TFT-bootstrap when applied to such estimated 
sequences. For example, in change-point analysis (Section 6.2) as well as 
unit-root testing (Section 6.3) one can use estimators to obtain an approx- 
imation of the underlying stationary sequence under the null hypothesis as 
well as under the alternative. As in both examples the null hypothesis is 
that of a stationary sequence; this feature enables us to construct bootstrap 
tests that capture the null distribution of the statistic in question even when 
presented with data that obey the alternative hypothesis. As a consequence, 
these bootstrap tests asymptotically capture the correct critical value even 
under the alternative hypothesis which not only leads to the correct size of 
the tests but also to a good power behavior. 

The remainder of the paper is organized as follows. In the next section 
we give a detailed description on how the TFT-bootstrap works. In partic- 
ular we describe several specific possibilities of how to get pseudo-Fourier 
coefficients. In Section 3 we state the main theorem, a functional limit the- 
orem for the TFT-bootstrap. The FCLT holds true under certain high-level 
assumptions on the bootstrapped Fourier coefficients; these are further ex- 
plored in Sections 4 and 5. In particular it is shown that the TFT-bootstrap 
replicates the correct second-order moment structure for a large class of 
observed processes including nonlinear processes (cf. Section 5). Finally, we 
prove the validity of the TFT-bootstrap for certain applications such as unit- 
root testing or change-point tests in Section 6 and explore the small sample 
performance in the simulation study of Section 7. Our conclusions are sum- 
marized in Section 8. Proofs are sketched in Section 9, while the complete 
technical proofs can be found in electronic supplementary material. 

2. Description of the TFT-bootstrap. Assume we have observed V(l), . . . , 
V(T), where 

Assumption V.l. {V(i) :i > 1} is a stationary process with absolutely 
summable auto-covariance function 7(-). In this case the spectral density of 
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the process exists, is continuous and bounded. It is defined by 

1 oo 

(2-1) f ^ = ^ E e" mA 7(n) 

n=— oo 

(see, e.g., Brockwell and Davis [5], Corollary 4.3.2). 



Since we will prove a functional central limit theorem for the bootstrap 
sequence, the procedure only makes sense if the original process fulfills the 
same limit theorem. 

Assumption V. 2. {V(i) :i > 1} fulfills the following functional central 
limit theorem: 

f 1 [Tul 1 

< - V (V(t) - EV(1)) : < u < 1 } -A {W(u) : < u < 1}, 

where /(•) is the spectral density of {V(-)} and {W(-)} is a standard Wiener 
process. 

We may need the following assumption on the spectral density: 

Assumption V. 3. Let the spectral density be bounded from below 
/(A)>c>0 for all < A < 2vr. 

We denote by Z(j) = V(j) — Vt the centered counterpart of the observa- 
tions where Vt = T -1 ^2+ V(t). Consider the FFT coefficients of the observed 
stretch, that is, 

1 T 

aft) = cosC-Ajt), 
^ T t=i 

1 T 

yU) = -7=^2v(t)sn(-X j t), 

t=l 

thus 

1 T 

x(j)+iy(j) = -^y2 v (t) exp(-iXjt), 
^ T t=i 

where Xj = 2nj/T for j = 1, . . . ,T. Note that the Fourier coefficients x(j), 
y(j) depend on T, but to keep the notation simple we suppress this depen- 
dence. 

The principal idea behind all bootstrap methods in the frequency domain 
is to make use of the fact that the Fourier coefficients x(l),y(l), . . . ,x(N), 
y(N) are asymptotically independent and normally distributed, where N = 
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L(T — l)/2j denotes the largest integer smaller or equal to (T — l)/2, and 
Ex(j) -> 0, Ey(j) -»• and 

(2.2) 

varx(j) = 7t/ (Aj) + o(l), vary(j) = nf(Xj) + o(l) 

for j = 1, . . . , N as T — > oo where /(•) is the spectral density (see, e.g., Chap- 
ter 4 of Brillinger [4] for a precise formulation of this vague statement). 
Lahiri [35] gives necessary as well as sufficient conditions for the asymptotic 
independence and normality of tapered as well as nontapered Fourier coef- 
ficients for a much larger class of time series not limited to linear processes 
in the strict sense. Shao and Wu [55] prove this statement uniformly over all 
finite subsets for a large class of linear as well as nonlinear processes with 
nonvanishing spectral density. The uniformity of their result is very help- 
ful since it implies convergence of the corresponding empirical distribution 
function (see the proof of Lemma 5.3 below). 

The better known result on the periodogram ordinates states that x(j) 2 + 
y(j) 2 are asymptotic independent exponentially distributed with expectation 
27r/(Aj) (see, e.g., Chapter 10 of Brockwell and Davis [5]). The latter is 
what most bootstrap versions are based on. By contrast, our TFT-bootstrap 
will focus on the former property (2.2), that is, the fact that x(j),y(j) are 
asymptotically i.i.d. N(0, nf(Xj)). 

Let us now recall some structural properties of the Fourier coefficients, 
which are important in order to understand the procedure below. First note 
that 

(2.3) x(T-j) = x(j) and y(T - j) = -y(j). 

This symmetry implies that the Fourier coefficients for j = 1, . . . , N carry all 
the necessary information required in order to recapture (by inverse FFT) 
the original series up to an additive constant. 

The symmetry relation (2.3) shows in particular that all the information 
carried in the coefficients for Xj, j > T/2, is already contained in the coeffi- 
cients for A,-, j < N. The information that is missing is the information about 
the mean of the time series which is carried by the remaining coefficients 
belonging to At and X T / 2 (the latter only when T is even). 

To elaborate, 

x(T) = VTV T 

carries the information about the mean of the observations V(l), . . . , V(T); 
moreover, 

y(T) = 0. 

For T even, we further have some additional information about the "alter- 
nating" mean 

1 T 

y(T/2) = and x(T/2) = -= £(-l)V(t). 

t=l 
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Note that the value of the FFT coefficients for j = 1, . . . , N is the same 
for a sequence V(t) — c for all c £ K. Hence those Fourier coefficients are 
invariant under additive constants and thus contain no information about 
the mean. Similarly all the information about the bootstrap mean is carried 
only in the bootstrap version of x(T) [as well as x(T/2)\. The problem of 
bootstrapping the mean is therefore separated from getting a time series 
with the appropriate covariance structure and will not be considered here. 
In fact, we show that any asymptotically correct bootstrap of the mean if 
added to our bootstrap time series [cf. (2.5)] yields the same asymptotic 
behavior in terms of its partial sums as the original uncentered time series. 
Our procedure works as follows: 

Step 1: Calculate the Fourier coefficients using the fast Fourier transform 
(FFT) algorithm. 

Step 2: Let y*(T) = x*(T) = 0; if T is even, additionally let x*(T/2) = 
y*(T/2) = 0. 

Step 3: Obtain a bootstrap sequence x*(l),y*(l), ... ,x*(N),y*(N) using, 
for example, one of bootstrap procedures described below. 

Step 4: Set the remaining bootstrap Fourier coefficients according to (2.3), 
that is, x*(T - j) = x*(j) and y*(T - j) = -y*{j). 

Step 5: Use the inverse FFT algorithm to transform the bootstrap Fourier 
coefficients x*(j) + iy*(j), j = 1, . . . ,T, back into the time domain. 

We thus obtain a bootstrap sequence {Z*(t) : 1 < t < T} which is real- valued 
and centered, and can be used for inference on a large class of statistics that 
are based on partial sums of the centered process {Z(-)}; see Section 6.2 for 
examples. 

Remark 2.1. Note that the exact form of Z*{t) is the following: 
1 T 

Z*(t) = — J>*(j) + iy*(j)) exp(2nitj/T) 
V i=i 

(2 - 4) 

= -=y>*(j)cos(27rt7yT) - y*(j)sin(27rfj/T)). 

Remark 2.2. In order to obtain a bootstrap sequence of the noncen- 
tered observation process {V(-)} we can add a bootstrap mean n* T to the 
{Z*(-)} process; here, /i^ is obtained by a separate bootstrap process in- 
dependently from {Z*(-)}, which is asymptotically normal with the correct 
variance, that is, it fulfills (3.1). 

Precisely, the bootstrap sequence 



(2.5) 



V*(t) = Z*(t) + fi* T 
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gives a bootstrap approximation of {V(-)}. Here, {Z*(-)} contains the in- 
formation about the covariance structure of the time series, and fj,* contains 
the information of the sample mean as a random variable of the time series. 
How to obtain the latter will not be considered in this paper. 

In Corollary 4.1 we give some conditions under which the above proce- 
dure remains asymptotically valid if instead of the process V(-) we use an 
estimated process V(-); this is important in some applications. 

Now we are ready to state some popular bootstrap algorithms in the fre- 
quency domain. We have adapted them in order to bootstrap the Fourier 
coefficients rather than the periodograms. Our procedure can easily be ex- 
tended to different approaches. 

Residual-based bootstrap (RB). 

Step 1: First estimate the spectral density /(•) by /(•) satisfying 
(2.6) sup |/(A)-/(A)| AO. 

Ae[0,7r] 

This will be denoted Assumption A.l in Section 5. Robinson [53] proves 
such a result for certain kernel estimates of the spectral density based on 
periodograms for a large class of processes including but not limited to linear 
processes. For linear processes he also proves the consistency of the spec- 
tral density estimate as given above when an automatic bandwidth selection 
procedure is used. Shao and Wu [55] also prove this result for certain kernel 
estimates of the spectral density for processes satisfying some geometric- 
moment contraction condition, which includes a large class of nonlinear pro- 
cesses. Both results are summarized in Lemma 5.1. 

Step 2: Next estimate the residuals of the real, as well as imaginary, part of 
the Fourier coefficients and put them together into a vector {sj : 1 < j < 2N}; 
precisely let 

~ _ <j) _ v(J) 

j = 1, . . . , N . Then standardize them, that is, let 

= 5-i/TOS^i5j 

VV(2A0 ES (Si - V(2A0 E?=i ^) 2 ' 

Heuristically these residuals are approximately i.i.d., so that i.i.d. resampling 
methods are reasonable. 

Step 3: Let s*, j = 1,...,2N, denote an i.i.d. sample drawn randomly 
and with replacement from s\, . . . ,S2N- As usual, the resampling step is 
performed conditionally on the data V(l), . . . , V(T). 
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Step 4: Define the bootstrapped Fourier coefficients by 

(2.7) x*(j) = yjirf(\ j )s* j , y*(j) = yf^)s% +j . 

An analogous approach — albeit focusing on the periodogram ordinates in- 
stead of the FFT coefficients — was proposed by Franke and Hardle [18] in 
order to yield a bootstrap distribution of kernel spectral density estimators. 

Wild bootstrap (WB). The wild bootstrap also makes use of an esti- 
mated spectral density further exploiting the knowledge about the asymp- 
totic normal distribution of the Fourier coefficients. Precisely, the WB re- 
places Sj above by independent standard normal distributed random vari- 
ables {Gj : 1 < j < 2N} in order to obtain the bootstrap Fourier coefficients 
as in (2.7). This bootstrap was already suggested by Hurvich and Zeger [25], 
who considered it in a simulation study, but did not obtain any theoretical 
results. 

An analogous approach — albeit focusing on the periodogram — was dis- 
cussed by Franke and Hardle [18] who proposed multiplying the periodogram 
with i.i.d. exponential random variables. 

Local bootstrap (LB). The advantage of the local bootstrap is that it does 
not need an initial estimation of the spectral density. The idea is that in a 
neighborhood of each frequency the distribution of the different coefficients 
is almost identical (if the spectral density is smooth). It might therefore 
be better able to preserve some information beyond the spectral density 
that is contained in the Fourier coefficients. An analogous procedure for 
periodogram ordinates was first proposed by Paparoditis and Politis [42]. 
For the sake of simplicity we will only consider bootstrap schemes that are 
related to kernels. 

Recall that x(-j) = x(j), x(\T/2~\ + j) = x(N - j) and y(-j) = -y(j), 
y(\T/2] + j) = y(N -j) for j = 1, . . . ,N + 1, and, for T even, x(N + 1) = 
y(N + 1) = ^= Ylt=i(~ (M denotes the smallest integer larger or 

equal than x). Furthermore let x(0) = y(Q) = be the Fourier coefficients 
of the centered sequence. For j < —T/2 and j > T/2 the coefficients are 
periodically extended with period T. 

Step 1: Select a symmetric, nonnegative kernel K (•) with 

j K(t)dt = l. 

In Section 5 we assume some additional regularity conditions on the kernel 
in order to get the desired results. Moreover select a bandwidth hr fulfilling 
Ht — > but TfiT — > oo. 
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Step 2: Define i.i.d. random variables Ji Xj • • • > J2N,t on Z with 
(2.8) ^_ J »(^ r _. ) _^_____. 

Independent of them define i.i.d. Bernoulli r.v. B\, . . . , I?2jV with parameter 
1/2. 

Step 3: Consider now the following bootstrap sample: 

'x(j + J jtT ), ifB; = 0, 



and 



y(j + J jt T), ifB i = l, 



y(j + Jn+j,t), if ^TV+j = o, 
xU + Jn+j,t), if -Bjv+j = 1. 



Finally the bootstrap Fourier coefficients are defined as the centered versions 
of x* , respectively, y* , namely by 

= - - ^ Ps T (x(j + s) + y(j + s)), 

y*(i) =y*(i) - ^^2p s ,t( x U + s ) + vU + s ))- 

This is slightly different from Paparoditis and Politis [42] , since they require 
that x*(j) and y*(j) share the same Jj t T which is reasonable if one is inter- 
ested in bootstrapping the periodogram but not necessary for bootstrapping 
the Fourier coefficients. 

Comparison of the three methods. The aforementioned three bootstrap 
methods, residual bootstrap (RB), wild bootstrap (WB) and local bootstrap 
(LB), are all first-order consistent under standard conditions. A rigorous the- 
oretical comparison would entail higher-order considerations which are not 
available in the literature and are beyond the scope of this work. Intuitively, 
one would expect the RB and LB procedures to perform similarly in appli- 
cations since these two bootstap methods share a common underlying idea, 
that is, that nearby periodogram/FFT ordinates are i.i.d. By contrast, the 
WB involves the generation of extraneous Gaussian random variables thus 
forcing the time-domain bootstrap sample paths to be Gaussian. For this 
reason alone, it is expected that if a higher-order property holds true in 
our setting, it will likely be shared by RB and LB but not WB. Our finite- 
sample simulations in Section 7 may hopefully shed some additional light 
on the comparison between RB and LB. 

2.1. Comparison with other frequency domain methods. First, note that 
the TFT wild bootstrap is identical to the parametric frequency-domain 
bootstrap proposal of Hurvich and Zeger [25]. By contrast, the nonparamet- 
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ric bootstrap proposal of Hurvich and Zeger [25] was based on prewhitening 
via an estimate of the MA(oo) transfer function. Estimating the transfer 
function presents an undesirable complication since prewhitening can be 
done in an easier fashion using any consistent estimator of the spectral den- 
sity; the residual-based TFT exploits this idea based on the work of Franke 
and Hardle [18]. The local bootstrap TFT is a more modern extension of the 
same underlying principle, that is, exploiting the approximate independence 
(but not i.i.d.-ness) of periodogram ordinates. 

We now attempt to shed some light on the relation between the TFT and 
the surrogate data method of Theiler et al. [57]. Recall that the surrogate 
data approach amounts to using 

(2.9) y /j(j)co6(2irU j ) + i^/ljj) sin(27r^) 

as bootstrap Fourier coefficients at point j where = x 2 (j)+y 2 (j) is 
the periodogram at point j, and {Uj :j} are i.i.d. uniform on [0,1]. So the 
periodogram (and mean) computed from surrogate is identical to the original 
ones. As a result we have the following proposition. 

Proposition 2.1. The surrogate data method fails to approximate the 
distribution of any statistic that can be written as a function of the peri- 
odogram (and/or sample mean) as long as this statistic has nonzero large- 
sample variance. 

The proof of this proposition is obvious since by replicating the peri- 
odogram (and sample mean) exactly, the surrogate approach will by neces- 
sity approximate the distribution of the statistics in question by a point 
mass, that is, zero variance. Hence, the surrogate data method will not 
be able to correctly capture distributional properties for a large class of 
statistics that are based entirely on periodograms and the mean (cf. also 
Chan [10]). In hypothesis testing, this failure would result in having power 
equal to the size when the surrogate approach is applied to obtain critical 
values for test statistics based entirely on periodogram and/or mean; at the 
very least, some loss of power in other situations can be expected. 

The aforementioned TFT methods obviously do not have this disadvan- 
tage; in fact, they can be successfully applied to this precise class of statistics 
(cf. Section 6.1). 

For comparison purposes, we now describe a nonsmoothed wild bootstrap 
that is the closest relative of the surrogate data method that fits in our 
framework. Note that all TFT-bootstrap schemes involve smoothing in the 
frequency domain before resampling (cf. also Assumption B.2); however, 
one could consider bootstrapping without this smoothing step. As the wild 
bootstrap works by multiplying normal random variables with an estimator 
of the spectral density, that is, a smoothed version of the periodogram, the 
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nonsmoothed wild bootstrap multiplies normal random variables with the 
original Fourier coefficients x(j),y(j). By the Box-Muller transform, the 
nonsmoothed wild bootstrap gives the following as the bootstrap (complex- 
valued) Fourier coefficient at point j: 

(2.10) x(j) V /-21og(tT i ) cos(2vr[/ i ) + iy(j)y/ -21og(E7 j ) sin(27r^), 

where {Uj : j} , {Uj : j} are i.i.d. uniform on [0,1] independent from each 
other. 

Comparing equation (2.10) to equation (2.9) we see that the surrogate 
data approach is closely related to the nonsmoothed wild bootstrap; the 
main difference is that the wild bootstrap does not only bootstrap the phase 
but also the magnitude of the Fourier coefficients. Nevertheless, the nons- 
moothed wild bootstrap does not suffer from the severe deficiency outlined 
in Proposition 2.1 since it does manage to capture the variability of the 
periodogram to some extent. To elaborate, note that it is possible to prove 
a functional limit theorem [like our Theorem 3.1(a) in the next section] for 
the nonsmoothed wild bootstrap but only under the provision that a smaller 
resample size is employed; that is, only a fraction of the bootstrap sample is 
used to construct the partial sum process (m/T — > 0). This undersampling 
condition is necessary here since without it the asymptotic covariance struc- 
ture would not be correct. Hence, even the nonsmoothed wild bootstrap, 
although crude, seems (a) preferable to the surrogate data method and (b) 
inferior with respect to the TFT-bootstrap; this relative performance com- 
parison is clearly born out in simulations that are not reported here due to 
lack of space. 

3. Functional limit theorem for the bootstrap sample. In this section we 
state the main result, namely a functional limit theorem for the partial sum 
processes of the bootstrap sample. 

The theorem is formulated in a general way under some meta-assumptions 
on the resampling scheme in the frequency domain that ensure the functional 
limit theorem back in the time domain. In Section 4 we verify those con- 
ditions for the bootstrap schemes given in the previous section. We would 
like to point out that the meta-assumptions we give are the analogues of 
what is usually proved for the corresponding resampling schemes of the pe- 
riodograms, which are known to hold for a large class of processes. 

The usage of meta-assumptions allows the reader to extend results to 
different bootstrap schemes in the frequency domain. 

By E*, var*, cov* and P* we denote as usual the bootstrap expectation, 
variance, covariance and probability. We essentially investigate three sets of 
assumptions. 

The first one is already implied by the above mentioned bootstrap schemes. 
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Assumption 23.1. For the bootstrap scheme in the frequency domain, 
the coefficients {x* (k) : 1 < k < N} and {y* (k) : 1 < k < N} are independent 
sequences as well as mutually independent (conditionally on the data) with 

E*(x*(k)) =E*(y*(k)) =0. 

Remark 3.1. Instead of assuming that the bootstrap samples are al- 
ready centered it is sufficient that the bootstrap means in the frequency 
domain converge uniformly to with a certain rate, that is, 

sup (\E*(x*(k))\ + |E*( y *(£0)|) = o P ( /— ™ ) , 

where m is the parameter figuring in Lemma 3.1 (resp., Theorem 3.1 below). 

Assumption B.I. Uniform convergence of the second moments of the 
bootstrap sequence in the frequency domain, that is, 

sup \var*(x*(k)) - irf(\ k )\ = o P (l), 

l<k<N 

sup |var*(y*(fc)) - 7r/(A fc )| = o P (l). 

l<k<N 

Assumption B.3. Uniform boundedness of the fourth moments of the 
bootstrap sequence in the frequency domain 

sup E*(x*(k)) A <C + o P {l), sup E*(y*(k)) A <C + o P (l). 

l<k<N l<k<N 

Let us now recall the definition of the Mallows distance on the space of 
all real Borel probability measures with finite variance. It is defined as 

d 2 (Pi,P2) = mi(E\X 1 -X 2 \ 2 ) 1 ^, 

where the infimum is taken over all real- valued variables {Xi^X^) with 
marginal distributions Pi and P 2 , respectively. Mallows [38] has proved the 
equivalence of convergence in this metric with distributional convergence in 
addition to convergence of the second moments. The results remain true if 
we have convergence in a uniform way as in Assumption BA below. This 
shows that Assumption BA implies Assumption B.I. 

Assumption BA. Let the bootstrap scheme in the frequency domain 
converge uniformly in the Mallows distance to the same limit as the Fourier 
coefficients do 

sup d 2 (C*(x*(j)),N(0,7Tf(\ j ))) = O P (l), 

sup d 2 (C*(y*(j)),N(0,7rf(X j ))) = o P (l). 

l<j<N 
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We will start with some results concerning the asymptotic covariance 
structure of the partial sum process; all asymptotic results are taken as 
T^oo. 

Lemma 3.1. 

(a) Let Assumption B.l be fulfilled. Then, for any < u < 1 and 1 < m <T , 



/ -, \ mu \ \ 
E* (-= VZ*(() 



0. 



(b) Let Assumptions V .1, B.l and B.2 be fulfilled. Then, for < u, v < 1, 

|_muj ^ L m,J J 

cov" 



V m ,i V m ,1 

v '1 = 1 t2 = l 



J 27r/(0)min(ii,u), - -> 0, 



27r/(0)[min(u, v) — uv], m = T. 
(c) Moreover, under Assumptions V.l, B.l and B.2, 
cov*(Z*{h), Z*{1 2 )) = cov(V(h), V{h)) + o P (l) = cov(Z(h), Z{1 2 )) + o P (l) 
for all fixed h,l2- 

As already pointed, out using frequency domain methods separates the 
problem of an appropriate bootstrap mean from the problem of obtaining 
a bootstrap sample with the appropriate covariance structure. As a result, 
the bootstrap sample {Z*(-)} is centered and thus the bootstrap version of 
the centered time series {•£(•)} = {V(-) — Vr}- 

The above lemma shows that the bootstrap process {Z*(-)} as well as its 
partial sum process has the correct auto-covariance structure. The following 
theorem gives a functional central limit theorem in the bootstrap world, 
showing that the bootstrap partial sum process also has the correct second- 
order moment structure. In fact, the partial sum process of a centered time 
scries converges to a Brownian bridge, while the subsampled partial sum 
processes converges to a Wiener process. As the following theorem shows 
this behavior is exactly mimicked by our TFT-bootstrap sample. 

Theorem 3.1. Let Assumptions V.l, B.1-B.3 be fulfilled. 
(a) If m/T — >0, then it holds (in probability) 

{ [mu\ \ 
_= X)Z*(Q:0<u<iy(0 \ D -H ] {W(u):0< U <l}, 

where {W{u) : < u < 1} is a Wiener process. 
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(b) If additionally Assumption B.4 is fulfilled, we obtain (in probability) 



where {B(u) : < u < 1} is a Brownian bridge. 

Remark 3.2. The stronger Assumption BA is needed only to get asymp- 
totic normality of the partial sum process, that is, part (b) above. In the 
proof of Theorem 3.1 for m/T — )■ we use the Lindeberg condition to obtain 
asymptotic normality. However, for m = T the latter is not fulfilled because 
the variance of single summands (e.g., l = T) is not negligible anymore. But 
for the same reason the Feller condition is also not fulfilled which means 
that we cannot conclude that the sequence is not asymptotically normal. In 
fact, failure of asymptotic normality is hard to imagine in view of Corol- 
lary 3.2. Therefore we recommend to always use m = T in applications even 
in situations where Assumption BA is hard to verify. 

Remark 3.3. The bootstrap variance is usually related to the peri- 
odogram which is not consistent without smoothing. Therefore Assump- 
tion B.2 ensures that the bootstrap scheme includes some smoothing. For 
the bootstrap, however, this is not entirely necessary, and we can also boot- 
strap without smoothing first. The simplest example is the nonsmoothed 
wild bootstrap as described in Section 2.1. One can then still prove the re- 
sult of Theorem 3.1, but only for m/T — > 0. In this situation this condition is 
necessary, since without it the asymptotic covariance structure is not correct 
(i.e., the assertion of Lemma 3.1 is only true for m/T — >■ 0), m ~ yT would 
be a good rule of thumb. While this is a very simple approach (without any 
additional parameters) it does not give as good results as the procedure we 
propose. Heuristically, this still works because the back-transformation does 
the smoothing, but to obtain a consistent bootstrap procedure we either 
need some smoothing in the frequency domain as in Assumption B.2 or do 
some under-sampling back in the time domain, that is, m/T — > 0. In fact, 
one of the main differences between our wild TFT-bootstrap and the surro- 
gate data approach is the fact that the latter does not involve any smoothing 
in the frequency domain. The other difference being that the surrogate data 
approach only resamples the phase but not the magnitude of the Fourier 
coefficients. For more details we refer to Section 2.1. 

Some applications are based on partial sums rather than centered par- 
tial sums. This can be obtained as described in Remark 2.2. The following 
corollary then is an immediate consequence of Theorem 3.1(b). 
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Corollary 3.1. Let the assumptions of Theorem 3.1(b) be fulfilled. Let 
fX^P be a bootstrap version of the mean /z = EV(l) [taken independently from 
{Z*(-)}J such that for all z£R, 

(3.1) p*(Vr04 -/*)<«) A$ 



v^F/(o)/ 



where <&(■) denotes the standard normal distribution function, so that the 
asymptotic distribution is normal with mean and variance 2-7r/(0). TTien 
it ZioWs (m probability) 

< V(V*(t)-/x):0<u<l W-) I °H ] {W(u):0<u< 1}, 

w/iere V*(i) = Z*(t) + n* T . 

Along the same lines of the proof we also obtain the analogue of the finite- 
sample result of Braun and Kulperger [3] for the surrogate data method. 
This shows that any finite sample has the same covariance structure as the 
corresponding finite sample of the original sequence; however, it also shows 
that not only do the partial sums of the bootstrap sample become more and 
more Gaussian, but each individual bootstrap observation does as well. 



Corollary 3.2. If Assumptions V.l, B.1-B.3 are fulfilled, then for 
any subset li,...,l p of fixed positive integers it holds (in probability) 

(Z*(h),...,Z*(l p )\V(-))-^N(0,X), 
where E = (o^Oi^i,...^ with a itj = cav(V(k), V(lj)). 

4. Validity of the meta-assumptions on the bootstrap Fourier coefficients. 

In this section we prove the validity of the bootstrap schemes if the Fourier 
coefficients satisfy certain properties. These or related properties have been 
investigated by many researchers in the last decades and hold true for a 
large variety of processes. Some of these results are given in Section 5. 

Recall Assumption A..1, which is important for the residual-based boot- 
strap (RB) as well as the wild bootstrap (WB). 

Assumption A.l. Let /(■) estimate the spectral density /(•) in a uni- 
form way, that is, fulfilling (2.6), sup Ag[0j7r] |/(A) - /(A)| — > 0. 

The next two assumptions are necessary to obtain the validity of the 
residual-based bootstrap. 
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Assumption A. 2. The following assertions on sums of the periodogram 
and/or Fourier coefficients hold: 



/v 



N 



i N 



(") 



i m p 



<C + o P (l) 



for some constant C > 0, where I (J) = x 2 (j) + y 2 (j) is the periodogram. 



In particular, (ii) is fulfilled if X)j=i 



1 v^/V I 2 U) p , 



7TO 



C>0. 



Assumption A. 3. The empirical distribution function based on the 
Fourier coefficients converges uniformly to the standard normal distribution 
function $(■), that is, 

JV 



sup 



l ^ 



0. 



The following two assumptions are necessary to obtain the validity of the 
local bootstrap (LB). 

Assumption AA. The following assertions on sums of the periodogram 
and/or Fourier coefficients hold true: 



(i) sup 

Kk<N 



(ii) sup 

Kk<N 



^2 Pj,T(x(k+j)+y(k+j)) 

=— oo 

oo 

p j , T I(k + j)-2xf(X k 



j=-oo 
oo 



= o P (l), 



iii) sup Pj,Tl 2 (k + j) <C + o P (l), 



Kk<N „ 

where J(j) = x 2 (j) + y 2 (j) if j is not a multiple of T and I(cT) = for cgZ 
and p^T are as in (2.8). 

Assumption .4.5. The empirical distribution function based on the 
Fourier coefficients converges uniformly to the standard normal distribution 
function $(•)> that is, 

JV 



sup P sup 



3=1 



{xO)<v^/(^)} 
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->o, 

where pj ; T are as in (2.8). 

The next theorem shows that the bootstrap methods RB, WB and LB ful- 
fill the Assumptions B.l-BA which are needed to obtain the functional limit 
theorem for the partial sums (cf. Theorem 3.1) under the above assumptions 
on the periodograms. 

Theorem 4.1. Let Assumption V .1 be fulfilled. 

(a) All three bootstrap methods RB, WB and LB fulfill Assumption B.l by 
definition. 

(b) Residual-based bootstrap RB: Let Assumption V .3 hold: 

(i) Under Assumption A.t RB fulfills Assumption B.2. 

(ii) If additionally Assumption A. 2 holds, RB fulfills Assumption B.3. 

(iii) If additionally Assumption A. 3 holds, then Assumption B.4 holds. 

(c) Wild bootstrap WB: under Assumption A.l WB fulfills Assumptions 
B.2-B.I 

(d) Local bootstrap LB: 

(i) Under Assumptions A. 40) <ind «4.^(ii) LB fulfills Assumption B.2. 

(ii) If additionally Assumption «4..^(iii) holds, then LB fulfills Assump- 
tion B.3. 

(iii) If additionally Assumption A. 5 holds, then LB fulfills Assump- 
tion B.4. 

In many applications we apply bootstrap methods not directly to a sta- 
tionary sequence {V{t) :t > 1} but rather to an estimate {V{t) : t > 1} thereof. 
The next corollary gives conditions under which the bootstrap schemes re- 
main valid in this situation. 

In order to give some general conditions under which the procedure re- 
mains valid we need to specify the type of spectral density estimator we 
want to use for the residual-based as well as the wild bootstrap. We want 
to use the following kernel density estimator (see also Lemma 5.1): 

T. 1& K{{\-\ j )/h T )i{j) 



(4.1) /t(A) 



where l(j) = x 2 (j) + y 2 (j) is the periodogram at frequency Xj = 2irj /T if j 
is not a multiple of T and I(cT) =0, c£Z. 

In addition we need the following assumptions on the kernel. 
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Assumption K..1. Let K(-) be a positive even function with f K(X) dX = 
1 and 



2tt 



£ * (|^) = / K (*) + °w = 1 + 



Th T 

Assumption /C.2. Let 

sup |^(A)|=0(/i T 1 ), 

AS[0,27r] 

where 

Remark 4.1. The above assumption is not as restrictive as it may seem. 
For example, it is fulfilled for bounded kernels with compact support. More 
generally it holds that (cf., e.g., Priestley [52], equations (6.2.93)~(6.2.95)), 

K h{>) = £ k (i h T) ex P(Hi A )> 
where k{x) is the inverse Fourier transform of the kernel K(-), that is, 

2^ 



k{x) = — — / K(X) exp(ix\) dX, 



respectively, 

K(X) = — / k(x) exp(— ixX) dx. 
2vr J 

From the above representation it is clear that as soon as the sum in Kh(X) 
can be approximated by an integral for Ht small enough, it holds for T large 

which yields the above assumption again for bounded K{-). Assumption fC.2 
is correct under Assumptions /C.3 or KLA given in the next section. 

We are now ready to state the corollary. 

Corollary 4.1. Assume that the respective (for each bootstrap) condi- 
tions of Theorem 4-1 are fulfilled for {V(-)}. Moreover assume that we have 
observed a sequence {¥(•)} from which we can estimate {V(-)} by {V(-)} 
such that 

(4.3) ^ {V{t )-V{t)f = o P {a- 1 ) 

t=i 



20 



C. KIRCH AND D. N. POLITIS 



for ax —> oo defined below. Furthermore we assume that for the residual- 
based and wild bootstrap we use the spectral density estimator (4-1) with a 
kernel fulfilling Assumptions JC.l and IC.2. For the local bootstrap we also 
use a kernel fulfilling Assumptions IC.l and fC.2. 

(a) If olt = T 1 / 2 [Ht is the bandwidth in (4-1)], then the assertions 
of Theorem 4-1 f or the residual-based bootstrap RB remain true, but now 
given 

(b) If ax = h^ 1 [Ht is the bandwidth in (4-1)], then the assertions of 
Theorem 4-1 for the wild bootstrap WB remain true given {¥{•)}■ 

(c) If olt = (T/hT) 1 ^ 2 (h-x is the bandwidth as in the description of the 
local bootstrap IB), then the assertions of Theorem 4-1 for the local bootstrap 
IB remain true given {Y(-)}. 

Remark 4.2. There is no assumption on {^(-)} except for (4.3). In Sec- 
tion 6 different examples are given showing the diversity of possible 
including nonstationary processes. 

Assumption (4.3) is not the weakest possible as the proof shows. However, 
it is a condition that is fulfilled in many situations and one that is easy to 
verify (cf. Section 6). Weaker conditions would typically include many sine 
and cosine terms and would therefore be much more complicated to verify. 

5. Some properties of Fourier coefficients and periodograms. In this sec- 
tion we give some examples of processes as well as kernels which fulfill the 
assumptions of the previous section. This shows that the bootstrap has the 
correct second-order moment structure for a large class of processes includ- 
ing nonlinear ones. 

For the sake of completeness we summarize some recent results of kernel 
spectral density estimation leading to Assumption ^4.1 in Lemma 5.1. In 
Lemma 5.4(b) Assumption A A is also proved under a different set of as- 
sumptions. We give now a set of assumptions on the kernel as well as on the 
underlying process given by Robinson [53], respectively, Shao and Wu [55] 
to obtain consistent kernel spectral density estimates. 

Assumption /C.3. Let the kernel K{-) be a real, even function with 

f \K(\)\dX <oo, f K(X)d\ = l. 
Jr Jr 

Furthermore the inverse Fourier transform of K 

k(x) = j K{\)ex.Y>{ix\)d\ 
Jr 

satisfies |&(x)| < k{x), where k(x) is monotonically decreasing on [0, oo) and 
chosen to be an even function with 

oo 

(1 + x)k(x) dx < oo. 
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Assumption /C.4. Let the kernel K(-) be a real, even function with 
f^K(A) d\ = 1. Furthermore the inverse Fourier transform of K 

k(x) = / K(\) exp(ixA) d\ 
Jr 

is Lipschitz continuous with support [—1, 1]. 
Assumption V. 4. Assume 

N-r 

T v (r) = N' 1 ]T - Ey(0))(y(j + r) - EV(0)) 
i=i 

satisfies uniformly in r as T — >• oo 

T v (r) -ET v (r) = Op(N^) 

for some < v < 1/2. 

A detailed discussion of this assumption can be found in Robinson [53]; 
for linear processes with existing fourth moments Assumption "P. 4 is always 
fulfilled with u = l/2. 

Assumption V.5. Assume that V(j) - EV(0) = G(. . .,s(j - l),e(j)), 
where G(-) is a measurable function and is an i.i.d. sequence. Assume 

further that 

oo 

- EV(0M0),e(-l), ...)- E(V(j) - EV(0)\e(-l),e(-2), ...)\\ 

3=0 

< OO, 

where \\X\\ = ^JeJX^ . In case of linear processes this condition is equivalent 
to the absolute summability of the coefficients. 

The next assumption is stronger: 

Assumption V.6. Assume that V(j) = G(. . . ,e(j - l),e(j)), where G(-) 
is a measurable function and is an i.i.d. sequence. Further assume the 

following geometric-moment contraction condition holds. Let {e(-)l be an 
i.i.d. copy of {£{■)}, let V(j) = G(. . . ,e(— l),e(0),e(l), . . . ,e(j)) be a coupled 
version of V(j). Assume there exist a > 0, C > and < p = p{a) < 1, such 
that for all j £ N 

E{\V{j)-V{j)\ a )<CfP. 

This condition is fulfilled for linear processes with finite variance that are 
short-range dependent. Furthermore it is fulfilled for a large class of non- 
linear processes. For a detailed discussion of this condition we refer to Shao 
and Wu [55], Section 5. 
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Assumption V. 7. Assume that V(j) = G{. .. ,e(j — l),e(j)) is a station- 
ary causal process, where G(-) is a measurable function and {e(-)} is an i-i-d- 
sequence. Let V(j) = G(. . . , e(— l),e(0),e(l), . . . ,e(j)) be a coupled version 
of V(j) where e(0) =e(0) independent of {e(j)}. Furthermore assume 



^(E(^-^) 2 ) 1/2 <oo- 

i>0 



The following lemma gives some conditions under which Assumption A.l 
holds, which is necessary for the residual-based and wild bootstrap (RB and 
WB) to be valid. Moreover it yields the validity of Assumption .4..4(h), which 
is needed for the local bootstrap LB. Lemma 5.4 also gives some assumptions 
under which the kernel spectral density estimate uniformly approximates the 
spectral density. 

Lemma 5.1. Assume that Assumptions V .1 and K..1 are fulfilled, and 
let /t(A) be as in (4-1)- 

(a) Let Assumptions KL.3 and V-4 be fulfilled; additionally the bandwidth 
needs to fulfill h? + h T T v — > 0. Then 

max |/ T (A)-/(A)| Ao. 

Ae[0,2vr] 

(b) Let Assumptions tC.4, V .2, V .3 and V .6 be fulfilled. Furthermore let 
E|y(0)|^ < oo for some 4<fi<8 and fry ->• 0, {hxT^ 1 = 0(1) for some 
0<r]<(p-4)/ii, then 

max |/ T (A)-/(A)| AO. 

Ae[0,2vr] 

Remark 5.1. For linear processes Robinson [53] gives an automatic 
bandwidth selection procedure for the above estimator; see also Politis [49]. 

The following lemma establishes the validity of Assumptions A.2. 

Lemma 5.2. Let Assumption V .1 be fulfilled. 

(a) Then Assumption A.2(i) holds. 

(b) If additionally 

(5.1) max |cov(/(i),/(fc)) - (2 7 r/(A i )) 2 ^ fc | = o(l), 

l<],k<N 

where 8j t k = 1 if j = k and else 0, then Assumption ^4.^(ii) holds. 

(c) If additionally 

(5.2) max | cov(/ 2 (j), I 2 (A;)) - 4(27r/(A,)) 4 ^ fc | =o(l), 

1<j, k<N 
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then Assumption A.2(iu) holds, and, more precisely, 

Remark 5.2. The conditions of Lemma 5.2 are fulfilled for a large class 
of processes. 

(i) Theorem 10.3.2 in Brockwell and Davis [5] shows (5.1) for linear pro- 
cesses V(t) — EV(t) = X]jl-oo w j £ {t—j)i where {e(-)} is i.i.d. with Ee(0) = 0, 
Ee(0) 4 < oo and ^ < oo. Furthermore the rate of convergence for 
j k is uniformly 0(1/T). An analogous proof also yields (5.2) under the 
existence of 8th moments, that is, if Ee(0) 8 < oo. 

(ii) Lemma A. 4 in Shao and Wu [55] shows that (5.1) [resp., (5.2)] is ful- 
filled if the 4th-order cumulants (resp., 8th-order cumulants) are summable, 
4th (resp., 8th) moments exist and Ylj>o |j7(i)| < 00 (°f- a l so Theorem 4.3.1 
in Brillinger [4]). More precisely they show that the convergence rate is 
0(1/T) in (5.1). By Remark 4.2 in Shao and Wu [55] this cumulant con- 
dition is fulfilled for processes fulfilling Assumption V . 6 for a = 4 (resp., 
a = 8). 

(iii) Furthermore, Assumption A.I is fulfilled if Assumptions V.3 and V . 7 
are fulfilled (cf. Wu [58]). 

(iv) Chiu [12] uses cumulant conditions to prove strong laws of large num- 
bers and the corresponding central limit theorems for ^ J2j=i ^C\?)^*C\j) 
for k > 1. 

The next lemma shows that weighted and unweighted empirical distri- 
bution functions of Fourier coefficients converge to a normal distribution, 
hence showing that Assumptions .4.3 and .4.5 are valid. The proof is based 
on Theorem 2.1 in Shao and Wu [55], which is somewhat stronger than the 
usual statement on asymptotic normality of finitely many Fourier coeffi- 
cients as it gives the assertion uniformly over all finite sets of fixed cardinal 
numbers; this is crucial for the proof of Lemma 5.3. 

Lemma 5.3. Let Assumptions V.l, V .3 and V .5 be fulfilled. Further- 
more consider weights {wj^N A < j < n} such that J2j=i w j,N = 1 and 

Ylf=i w2 j N ~ * as N ^ oo, then 



'3=1 W 3,N 

N 

1 • 

sup 



1 N 



2 



Ao, 



where $(•) denotes the distribution function of the standard normal distribu- 
tion. If we have weights Wj : N,s with Ylj=i w j,N,s = 1 and sup s Ylj=i w j N s ~ ^ 
0, then the assertion remains true in the sense that for any e > it holds 
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( 1 r 

supP sup - ^(1 



{x(j)<z^J(X~)} + 1 {y(j)<z^f(^)} 



Hz) 



>£ 



0. 



The next lemma shows the validity of Assumptions AA and again A.l un- 
der a different set of assumptions. For this we need to introduce yet another 
assumption on the kernel. 

Assumption /C.5. Let Kh{X) as in (4.2) fulfill the following uniform 
Lipschitz condition (Xj = 2irj/T): 



h 2 T \K h (\ s )-K h {\ t )\<L K 



Remark 5.3. In case of a uniform Lipschitz continuous kernel with 
compact support, the assertion is fulfilled for hx small enough. For infinite 
support kernels we still get Assumption /C.5 as in Remark 4.1 under certain 
stronger regularity conditions. 

Lemma 5.4. Let the process {V(-)} fulfill Assumptions V .1 and V.3. 
Furthermore the bandwidth fulfills {h?T)~ l = o(l) and the kernel K(-) ful- 
fills Assumptions fCA and K..5 in addition to l/(Thr) J2j / (Thx)) = 
OA). 

(a) Assumption A..4(i) holds, if 



(5.3) 



1 N 

2^E(Wi)i+i^')i)=op(i) J 



3=1 

sup \cav(x(l),x(k)) -7r/(A fe )5 /ifc | 

l<l,k<N 



O 



(5.4) 



sup \cov(y(l),y(k)) - 7r/(A fc )fy fc | = O — . 



KLk<N 



1 

KT 
1 



KT 



(b) Assumption A.2(ii) together with (5A), where the convergence for 
j ^ k is uniformly of rate (hT)^ 1 , implies Assumption „4.^(ii) as well as 
Assumption AA for the spectral density estimator given in (4A). 

(c) Assumption A.2(in) together with (5.2), where the convergence for 
j ^ k is uniformly of rate (hT)~ l , implies Assumption AAiiii). 



Remark 5.4. By the boundedness of the spectral density (cf. Assump- 
tion V.l) (5.4) follows, for example, from Assumption ^4.2(ii). 
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If ^2j>ij u \'f(j)\ < oo for some v > the rate of convergence in (5.4) is 

1 2! T 

T~ v for 0<i/<1, — ^— for i/ = 1, T _1 for i/ > 1. 

For a proof we refer to the supplementary material [32], proof of Lemma 5.2. 
Some conditions leading to (5.1), respectively, (5.2), with the required con- 
vergence rates can be found in Remark 5.2. 

6. Some applications. In this section we show that while our procedure 
still works for the same class of periodogram-based statistics as the classical 
frequency bootstrap methods, we are also able to apply it to statistics that 
are completely based on the time domain representation of the observations, 
such as the CUSUM statistic for the detection of a change point in the 
location model or the least-squares test statistic in unit-root testing. 

6.1. Statistics based on periodograms. The classical applications of boot- 
strap methods in the frequency domain are kernel spectral density estima- 
tors (cf. Franke and Hardle [18], Paparoditis and Politis [42]) as well as ratio 
statistics and Whittle estimators (cf. Dahlhaus and Janas [15], Paparoditis 
and Politis [42]). This includes, for example, Yule- Walker estimators for 
autoregressive processes. 

A simple calculation yields 

I*(j) = ^(x*(j)+y*(j)f, 

where is the periodogram of the TFT-bootstrap time series Z*(-) at 

Xj, and x*(-), y*(-) are defined as in Section 2. Comparing that with the 
original bootstrap procedures for the periodograms, we realize that for the 
wild bootstrap we obtain exactly the same bootstrap periodogram, whereas 
for the residual-based as well as local bootstrap we obtain a closely related 
bootstrap periodogram but not exactly the same one. The reason is that 
we did not simultaneously draw the real and imaginary part of the boot- 
strap Fourier coefficient but further exploited the information that real and 
imaginary part are asymptotically independent. Yet, the proofs showing the 
validity of the bootstrap procedure for the above mentioned applications 
go through, noting that the bootstrap's real part and imaginary part are 
(conditionally on the data) independent. The above discussion shows that 
the procedures discussed in this paper inherit the advantages as well as 
disadvantages of the classical frequency bootstrap procedures. 

6.2. Change-point tests. In change-point analysis one is interested in de- 
tecting structural changes in time-series such as, for example, a mean change 
in the following AMOC (at-most-one-change) location model: 

\n 2 + V(i), k<i<T, 
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where V(-) is a stationary process with EV(0) = 0; fix, [12 and k are un- 
known. The question is whether a mean change occurred at some unknown 
time k, the so called change-point. This shows that we are interested in 
testing 

H :k<T, A*i7^M2, H 1 :k = T. 

Typically, test statistics in this context are based on centered partial sums 
such as the well-known CUSUM statistic, 



max 

Kk<T 



3=1 



Y T ) 



Remark 6.1. For simplicity we only discuss the classical CUSUM statis- 
tic above. However, extensions to other test statistics in change-point anal- 
ysis, such as 

k 



C$\ct) 



j>2ct-l/2 



max 



i<k<T (k{T - k)) c 
T 2/3 



0<a< 



1 



T-l 



\{k{T-k)Y 



J2(Y(j)-Y T )) , 0</3<2, 



are straightforward using standard techniques of change-point analysis (cf., 
e.g., Kirch [31], proof of Corollary 6.1). This is not true for extreme-value 
type test statistics for which stronger results are needed. For a detailed 
discussion of typical test statistics we refer to Csorgo and Horvath [13, 14]. 

If {V(-)} fulfills Assumption V .1 we obtain the following limit under Hq 
(cf. also Horvath [24] and Antoch, Huskova and Praskova [1]): 

(6.1) — ^> sup \B(t)\, 

T 0<t<l 

where {B(-)} is a Brownian bridge and r 2 = 2nf(0), where /(•) is the spec- 
tral density of {V(-)}. 

Kirch [31] has already used permutation methods in the frequency domain 
to obtain approximations of critical values for change-point tests. Her idea 
was to use random permutations of the Fourier coefficients taking some 
symmetry properties into account before back-transforming them to the time 
domain using the FFT. However, the covariance structure of a time series is 
encoded in the variances of the Fourier coefficients; hence, this structure is 
destroyed by a simple permutation. 

We will now apply our TFT-bootstrap to obtain critical values for the 
above change-point tests. We do not directly observe the process V(-) since 
we do not know whether the null hypothesis or the alternative holds true; 
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thus, we estimate Z(-) = V(-) — Vt by 

z(t)=Y(t)-t 1 i {t ^ } -n 2 i {t>t} , 

where (e.g.) k = axgmax{|E, fe = i(^0') - Y T )\ : 1 < k < T}, ft = h £* =1 Y(j) 
and £2 = -^ E T = Y(j). 

T-k J=k+1 



The bootstrap statistic is then given by 



1 fc 

where {Z*(-)} is the TFT-bootstrap sequence defined in Section 2. 

The following theorem shows that the (conditional) limit of the bootstrap 
statistic is the same as that of the original statistic under Hq even if the 
alternative holds true. Hence, the distribution of the bootstrap statistic is a 
good approximation of the null distribution of the statistic, and the boot- 
strap critical values are asymptotically equivalent to the asymptotic critical 
values under both the null hypothesis as well as alternatives. This shows 
that the asymptotic test and the bootstrap test are asymptotically equiv- 
alent. In the next section a simulation study shows that frequently we get 
better results in small samples when using the TFT-bootstrap. 

Theorem 6.1. Suppose that the process V(t) fulfills the Hdjek-Renyi 
inequality (cf., e.g., Lemma B.l in Kirch [29] for linear processes), and let 

under H\ (k — k)/T = Op({3t), Pt 0. Furthermore let the assumptions in 
Theorem 4-1 hold and olt max(/3r, log T/T) — > for ay as in Corollary 4-1- 
Then it holds under Hq as well as Hi for all x S R 



C* 

P* ( max < x 

,l<fe<T T 



Y(1),...,Y(T)) Ap( sup |B(*)|<a; 



where r is as in (6.1). This shows that the corresponding bootstrap test 
(where one calculates the critical value from the bootstrap distribution) is 
asymptotically equivalent to the asymptotic test above. 

Remark 6.2. The condition (k — k)/T = Op {fix) is fulfilled for a large 
class of processes with varying convergence rates /3t; for certain linear pro- 
cesses we get the best possible rate (5t = 7 (cf., e.g., Antoch, Huskova and 
Praskova [1]), but often in the dependent situation the rates are not as good 
(cf., e.g., Kokoszka and Leipus [33]). 

It is still possible to get the above result under somewhat stronger assump- 
tions on ax, that is, on the bandwidth hr, if only weaker versions of the 
Hajek-Renyi inequality are fulfilled as, for example, given in Appendix B.l 
in Kirch [29] for fairly general processes. 
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Remark 6.3. For practical purposes it is advisable to use some type 
of studentizing here. We propose to use the adapted flat-top estimator 
with automatic bandwidth choice described in Politis [51] for the asymp- 
totic test as well as for the statistic of the original sample. Let R(k) = 

^Ej=i k z(t)z(t + k), 

{ 1, 1*1 < 1/2, 

w{t) = I 2(1 — 1/2<|*|<1, 

lo, |i|>l, 

and the bandwidth At = 2A, where A is the smallest positive integer such 
that \R(\ + k)/R(0)\ < 1.4y / log 10 T/T, for k = 1,2,3. 
Then, the estimator is given by 



A 



T 



(6.2) ? 2 = max \R(0) + 2 j£ w(k/A T )R(k), T(T _ l) £ Z(t) 2 j . 

The rightmost part in the parenthesis is chosen to ensure positivity and scale 
invariance of the estimator. 

For a discussion of a related estimator in change-point analysis we refer 
to Huskova and Kirch [26]. 

In the bootstrap domain we propose to use an estimator that is closely 
related to the bootstrap procedure, namely an estimator based on the boot- 
strap periodograms using the same kernel and bandwidth as for the boot- 
strap procedure (cf. also (10.4.7) in Brockwell and Davis [5]) 

(6.3) r 2 * =p QiT I*{l)+Y,(Pi,T+P-i,T)I*{j), 

where = x*(j) + y*(j) is the bootstrap periodogram and 

K(2irs/(Th T )) 

It can easily be seen using Assumptions B.1-B.3 that 

(6.4) e*(? 2 *-t 2 ) 2 AO, 

if ^2 j p 2 T — > which holds under very weak regularity conditions on the ker- 
nel. This shows that the studentized bootstrap procedure is asymptotically 
consistent. 

This estimator is naturally related to the bootstrap procedure and has 
proved to work best in simulations. This is similar (although maybe for 
different reasons) to the block bootstrap for which Gotze and Kiinsch [21] 
showed that, in order to obtain second-order correctness of the procedure, 
one needs to studentize the bootstrap statistic with the true conditional 
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variance of the bootstrap statistic (which is closely related to the Bartlett 
estimator), while for the original statistic one needs to use a different esti- 
mator such as the above mentioned fiat-top estimator. However, a detailed 
theoretical investigation of which type of studentizing is best suitable for 
the TFT-bootstrap is beyond the scope of this paper. 

6.3. Unit root testing. Unit root testing is a well studied but difficult 
problem. Key early references include Phillips [47] and Dickey and Fuller [16] 
(see also the books by Fuller [20] and by Hamilton [22]). Nevertheless, the 
subject is still very much under investigation (see, e.g., Cavaliere and Taylor 
[7-9], Chang and Park [11], Park [45], Paparoditis and Politis [44] and the 
references therein). 

The objective here is to test whether a given set of observations Y(0), . . . , 
Y(T) belongs to a stationary or a /(l)-time series (integrated of order one), 
which means that the time series is not stationary, but its first-order differ- 
ence Y(t) — Y(t — 1) is stationary. For simplicity we assume that Y(0) = 0, 
and we do not consider a deterministic trend component in this paper. The 
hypothesis test of interest can then be stated as 

H : {Y(.)} is 1(1), Ht : {Y(-)} is stationary. 

Now we note that for 

EY(t)Y(t-l) 

P = }^ EY(t-l)* 

the null hypothesis is equivalent to p = 1 (for a detailed discussion we refer to 
Paparoditis and Politis [43], Example 2.1). Denote V(t) = Y(t) - pY(t - 1), 
which is a stationary sequence under Hq as well as H\. While the bootstrap 
test below is valid for the general situation (Ho, Hi) above, it is intuitively 
easier to understand if one considers the following restricted situation, where 
Y(t) = pY(t — 1) + V(t) for some stationary {V(-)} with mean and tests 
Ho-p = 1 versus H\:\p\ < 1; this is the setup we use in the simulations 
below. 

An intuitive test statistic (cf. Phillips [47]) is given by Ut '■= T(j>t — 1) 
rejecting the null hypothesis if Ut < c a for some appropriate critical value 
Cq,, where 

TLiY(t)Y(t-l) 

PT = Tr 

ELn*-i) 2 

is a consistent estimator for p under both the null hypothesis as well as 
the alternative. Other choices for p and p\ are also possible (for a detailed 
discussion, see Section 2 of Paparoditis and Politis [43]). 

If {V(-)} fulfills Assumption V .1 with mean and additionally 
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then it holds under Hq that 

c W(l) 2 -a 2 /r 2 



U T := T(p T - 1) 



2jZw{t)*dt ' 

where t 2 = 27r/(0) and /(•) is the spectral density of the stationary sequence 
Y(t) — Y(t — 1), a 2 = vav(V(t)), and {W(-)} is a Wiener process (see also 
Phillips [47], Theorem 3.1). 

This shows that the limit distribution of Ut depends on the unknown 
parameters a 2 as well as r 2 if the errors are dependent. The famous Dickey- 
Fuller test is closely related to the above test statistic just using a slightly 
different normalization, but it suffers from the same problem. 

Phillips [47] and Phillips and Perron [48] suggest some modifications of the 
two tests mentioned above which do have a pivotal limit for time series errors 
as well. Later on, Perron and Ng [46], Stock [56] and Ng and Perron [40] 
propose to use the trinity of so-called M unit root statistics, which are also 
closely related to the above two tests but have pivotal limits for time series 
errors as well. Those M unit root tests are given by 

(1/T)Y(T) 2 -? 2 

2/r 2 Eli^-i) 2 

v 1/2 

MSB T = I ^-,> ]Y(t-l) 2 ) 



\ t=i 



as well as the product of the above two M statistics. As before r 2 denotes 
an estimator of r 2 . In the simulations we use the estimator as given in (6.2). 

All of the above mentioned statistics are continuous functions of the 
partial sum process {^i=i ^(^) ; < t < 1} under the null hypothesis [as 
Y(t) = X^=i y (.?')]> so that the null asymptotics are immediate consequences 
of the functional central limit theorem as given in Assumption V.2. For the 
test statistic Ut and the Dickey-Fuller test it is additionally needed that 
^^J =1 F 2 (j) — > a 2 . For example, the statistic MUt has the same asymp- 
totic limit as the statistic Ut with independent errors 

- W(l) 2 -l 
(6.5) MU T ' 



2JiW(t)*dt 

In the following we concentrate on the statistic MUt but the results for the 
other mentioned statistics follow analogously. 

We would like to apply the TFT-bootstrap to obtain critical values; that 
means we need a bootstrap sequence which is (conditionally) 1(1). In order 
to obtain this we estimate V(t) = Y(t) — pY(t— 1), which is stationary under 
both Hq as well as Hi, by 

V{t)=Y(t)-p T Y(t-l). 
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Then we can use the TFT-bootstrap based on V(-), that is, create a TFT- 
bootstrap sample V*(l), V*(2), . . . and obtain a bootstrap 1(1) sequence 
(i.e., a sequence fulfilling Hq) by letting 

y*(0)=Y(0), Y*(t) = Y*(t-l) + V*{t), t>l. 
The bootstrap analogue of the statistic MUt is then given by 
mr = (1/T)Y*(T)-t** 

T 2/T 2 J2 t = lT ( Y * it-l)) 2 ' 
where we use again the estimator r 2 * as in (6.3) for the bootstrap sequence. 

The following theorem shows that the conditional limit of the bootstrap 
statistic is the same as that appearing in the RHS of (6.5) no matter whether 
the original sequence follows the null or alternative hypothesis. This shows 
that the bootstrap critical values and thus also the bootstrap test is equiva- 
lent to the asymptotic critical values (and thus the asymptotic test), under 
both the null hypothesis as well as alternatives. 

Theorem 6.2. Suppose that the process {V(-)} has mean and ful- 
fills the assumptions in Theorem 4-1 and let (3.1) be fulfilled. Furthermore 
assume that under Hq as well as Hi it holds that 

1 T ~ X 

(6.6) {p _p T f-Y,Y 2 (t) = op{*- T 1 ) 

t=o 

for ot as in Corollary J^.l. Then it holds under Hq as well as Hi for all 
x£R that 

P*(MU^<x)-^p( W ^ 2 ~ 1 < x 



This shows that the corresponding bootstrap test (where one calculates the 
critical value from the bootstrap distribution) is asymptotically equivalent to 
the test based on (6.5). 

Remark 6.4. Condition (6.6) is fulfilled for a large class of processes 
and if ax/T— > 0, Theorem 3.1 in Phillips [47], for example, shows under 
rather general assumptions that under Hq 

1 T_1 

pr - p = Op{T~ l l — % Y 2 (t) = P (1). 

3=0 

Under Hi (6.6) also holds under fairly general assumptions (cf., e.g., Romano 
and Thombs [54], Theorem 3.1) if ax/T — >• 0; more precisely 

T-l 

p T -p = O p (T^/ 2 ), - £ Y\t) = P (1). 

j=0 
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7. Simulations. In the previous sections, the asymptotic applicability of 
the TFT-bootstrap was investigated. In this section we conduct a small 
simulation study in order to show its applicability in finite samples. 

To get a first impression of what a TFT-bootstrap sample looks like, 
we refer to Figure 1, which shows the original time series as well as one 
bootstrap sample. At first glance the covariance structure is well preserved. 

We use the statistical applications given in Section 6 to show that the 
TFT-bootstrap is indeed applicable. The usefulness of the procedure for 
statistics based on periodograms have already been shown by several authors 
(cf. Franke and Hardle [18], Dahlhaus and Janas [15] and Paparoditis and 
Politis [42]) and will not be considered again. 

However, the applicability for statistics that are completely based on time 
domain properties, such as the CUSUM statistic in change-point analysis 
or the above unit-root test statistics, is of special interest. More precisely 
we will compare the size and power of the tests with different parameters 
as well as give a comparison between the TFT, an asymptotic test, and 
alternative block resampling techniques. For change-point tests, the com- 
parison is with the block permutation test of Kirch [30]; in the unit-root 
situation we compare the TFT-bootstrap to the block bootstrap of Paparo- 
ditis and Politis [43]. For the TFT we use the local bootstrap (LB) as well 
as residual-based bootstrap (RB) with a uniform kernel (UK) as well as 
Bartlett-Priestley kernel (BPK) with different bandwidths. 

We visualize these qualities by the following plot: 

Achieved size-power curves (ASP). The line corresponding to the null 
hypothesis shows the actual achieved level on the y-axis for a nominal one 
as given by the x-axis. This can easily be done by plotting the empirical 
distribution function (EDF) of the p- values of the statistic under Hq. The 
line corresponding to the alternative shows the size-corrected power, that 
is, the power of the test belonging to a true level a test where a is given 
by the x-axis. This can easily be done by plotting the EDF of the p-values 
under the null hypothesis against the EDF of the p-values under the alter- 
native. 

In the simulations we calculate all bootstrap critical values based on 1,000 
bootstrap samples, and the ASPs are calculated on the basis of 1,000 repe- 
titions. 

Concerning the parameters for the TFT-bootstrap we have used a uni- 
form [K(t) = 2 <i>] as wen as Bartlett-Priestley kernel [K(t) = |(1 — 
t 2 )l{|f|<!}] with various bandwidths. All errors are centered exponential 
hence non-Gaussian. Furthermore AR(1) time series are used with coeffi- 
cient 7 = —0.5,0.5. Furthermore we consider GARCH(1,1) processes as an 
example of nonlinear error sequences. 
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Fig. 1. AR(1) with parameter a and corresponding TFT-bootstrap sample: residual-based 
bootstrap, Bartlett-Priestley kernel, /t = 0.01. 
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7.1. Change-point tests. We compare the power using an alternative that 
is detectable but has not power one already in order to pick up power differ- 
ences. For the AR(1) process with parameter a = —0.5, we choose d = 0.3; 
for a = 0.5 we choose d = 0.7 as changes are more difficult to detect for these 
time series. A comparison involving the uniform kernel (UK), the Bartlett- 
Priestley kernel (BPK) as well as bandwidth h = 0.01 and h = 0.03 can be 
found in Figure 2(a)— 2(d). It becomes clear that small bandwidths are best 
in terms of keeping the correct size, where the BPK works even better than 
the UK. However, this goes along with a loss in power which is especially 
severe for the BPK. Furthermore, the power loss for the BPK kernel is worse 
if combined with the local bootstrap. Generally speaking, the TFT works 
better for negatively correlated errors which is probably due to the fact that 
the correlation between Fourier coefficients is smaller in that case. 

In a second step, we compare the residual-based bootstrap (RB) with 
both kernels and bandwidth h = 0.01 with the block permutation method 
of Kirch [30] as well as the asymptotic test. The results are given in Fig- 
ure 2(e)-2(f). The TFT works best in terms of obtaining correct size, where 
the BPK beats the UK as already pointed out above. The power loss of the 
BPK is also present in comparison with the asymptotic as well as the block 
permutation methods; the power of the uniform kernel is also smaller than 
for the other method but not as severe as for the BPK. The reason probably 
is the sensitivity of the TFT with respect to the estimation of the underlying 
stationary sequence as in Corollary 3.1. In this example a mis-estimation of 
the change-point or the mean difference can result in an estimated sequence 
that largely deviates from a stationary sequence, while in the unit-root ex- 
ample below, this is not as important and in fact the power loss does not 
occur there. 

The simulation results for a GARCH(1, 1) time series with parameters 

u = 0.3, a = 0.7,13 = 0.2 [i.e., V(t) = a t e u af = oj + aeg-i + e t 
N(Q, 1)] are given in Figure 2(g) and 2(h), and it becomes clear that the 
conclusions are similar. The alternative in these plots is given by d = 0.7. 

7.2. Unit root testing. In the unit root situation we need a bootstrap 
sample of V(-) as in Corollary 3.1, where we additionally use the fact that 
{V(-)} has mean 0. In this case we additionally need a bootstrap version of 
the mean. For simplicity we use a wild bootstrap n* = IU(2vrf 2 /T) 1/2 , where 
W is standard normal distributed and r 2 is as in (6.2), where we replace 

Z(-) by V(-) — Vt- The alternative in all plots is given by p = 0.95. 

In Figure 3(a)-3(d) a comparison of different kernels and bandwidths for 
an AR(1) error sequence is given. It can be seen that again a small bandwidth 
yields best results, and in particular the BPK works better than the UK. 
Unlike in the change-point example we do not have the effect of a power loss. 



TFT-BOOTSTRAP 



35 



Null 

Alternative 



UK: h=0.01 
UK: h=0.03 
BPK: h=0.01 
BPK: h=0.03 



0.00 0.05 0.10 0.15 

(a) LB: a = 0.5 



0.20 









Null 






Alternative 












UK: h=0.01 




. ■'' ' 


--- UK:h=0.03 






BPK: h=0.01 




/ . • 


BPK: h=0.03 




/ ■ 











0.00 0.05 0.10 0.15 

(c) RB: a = 0.5 



0.20 




0.00 0.05 0.10 0.15 0.20 

(e) a = 0.5, TFT: h = 0.01 







Null 




Alternative 






UK: h=0.01 






•-- UK:h=0.03 






■ ■ BPK: h=0.01 






- - BPK: h=0.03 



0.00 0.05 0.10 0.15 

(g) RB: GARCH(1,1) 



0.20 




0.00 0.05 0.10 0.15 

(b) LB: a = -0.5 



0.20 



Null 

Alternative 



UK: h=0.01 
UK: h=0.03 
BPK: h=0.01 
BPK: h=0.03 



0.00 



0.05 0.10 0.15 

(d) RB: a = -0.5 



0.20 




0.05 0.10 0.15 

(f) a = -0.5, TFT: h = 0.01 





Null 

Alternative 




i: .■ 
• i 


asym. 




RB: UK 




RB: BPK 




block: K=5 




block: K=10 



0.00 0.05 0.10 0.15 

(h) GARCH(1,1), TFT: h = 0.01 



0.20 



Fig. 2. Change-point tests: ASP-plots for AR(1) process with parameter a with cen- 
tered exponential errors, respectively, GARCH(1, 1), d = fj.2 — ^i, T= 200, k — 100, BPK: 
Bartlett-Priestley kernel, UK: uniform kernel, LB: local bootstrap, RB: residual-based boot- 
strap. 
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Fig. 3. Unit-root tests: ASP-plots for AR(1) process with parameter a with centered ex- 
ponential errors, respectively, GARCH(1, 1), d = [i% — n\, T — 200, k — 100, BPK: Bartlet- 
t-Priestley kernel, UK: uniform kernel, LB: local bootstrap, RB: residual-based bootstrap. 
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Furthermore unlike in change-point analysis the bootstrap works better for 
positively correlated errors. 

A comparison with the asymptotic test as well as the block bootstrap by 
Paparoditis and Politis [43] can be found in Figure 3(e)— 3(f). In the case 
of a positive correlation all methods perform approximately equivalently, at 
least if we use the better working Bartlett-Priestley kernel; however for a 
negative correlation the TFT holds the level better than the other methods. 

Some results for a GARCH(1, 1) error sequence with parameters ui = 
0.3, a = 0.7, /S = 0.2 are shown in Figure 3(g) and 3(h). In this situation 
a somewhat larger bandwidth of h = 0.03 works slightly better and the TFT 
test leads to an improvement of the power of the tests. 

It is noteworthy that the appropriate bandwidth in all cases is smaller 
than what one might have expected to be a good choice. A possible ex- 
planation for this is that some undersmoothing is appropriate since the 
back-transformation will account for some additional smoothing. 

8. Conclusions. The subject of the paper is the TFT-bootstrap which is 
a general frequency domain bootstrap method that also generates bootstrap 
data in the time domain. Connections of the TFT-bootstrap with other 
methods including the surrogate data method of Theiler et al. [57], and the 
original proposals of Hurvich and Zeger [25] were thoroughly explored. 

It was shown that the TFT-bootstrap samples have asymptotically the 
same second-order moment structure as the original time series. However, 
the bootstrap pseudo-series are (asymptotically) Gaussian showing that the 
TFT-bootstrap approximates the original time series by a Gaussian process 
with the same covariance structure even when the original data sequence is 
nonlinear; see Section 3. 

Nevertheless, our simulations suggest that for small samples the TFT- 
bootstrap gives a better approximation to the critical values (as compared 
with the asymptotic ones) especially when studentizing is possible. Whether 
appropriate studentization results in higher-order correctness is a subject 
for future theoretical investigation. Choosing somewhat different types of 
bootstrapping in the frequency domain could also lead to higher-order cor- 
rectness without bootstrapping as in, for example, Dahlhaus and Janas [27] 
for the periodogram bootstrap. 

In fact the simulations suggest that a surprisingly small bandwidth (in 
comparison to spectral density estimation procedures) works best. When 
applied in a careful manner no smoothing at all still results — in theory — in 
a correct second- moment behavior in the time domain (cf. Section 2.1), sug- 
gesting that due to the smoothing obtained by the back-transformation a 
coarser approximation in the frequency domain is necessary to avoid over- 
smoothing in the time domain. 
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9. Proofs. In this section we only give a short outline of some of the 
proofs. All technical details can be obtained as electronic supplementary 
material [32]. 

We introduce the notation a n ^ b n :4$a n = 0(b n ). 

The following lemma is needed to prove Lemma 3.1. 

Lemma 9.1. Under Assumption V .1, the following representation holds: 
4 ^ 2 2 

— ^/(Afc)cOs(A fc /l) = ^2^f(h + lT) - — /(0) - — /(7r)exp(i7l7l)l {:r even}- 

k=i lez 

Proof of Lemma 3.1. By Lemma A. 4 in Kirch [31] it holds (uniformly 
in u) that 

[mu 

, m 



cos(A fc /) =o(mm(^-,'> 
i=i ^ ^ 



Thus it holds uniformly in u and v that 



N 



(9-1) E 



k=l 



[mil] 



N , v 2 

0(l)^min( T' m ) =0(mT). 

k=l ^ ' 



y COs(AfcZl) ^ cos (^kl2 
h=l h=l 

By Assumptions B.l and £>.2 and by (2.4) it holds that 

(\_mu\ \mv\ \ 

«1=1 «2 = 1 / 

L mu J L ml; J AT 

= Y E EE /(Afc) cos(A fc (/i - l 2 )) + op(m), 
; 1= i z 2 =i k=i 

where the last line follows for m/T — > as well as m = T by (9.1). Assertion 
(b) follows by an application of Lemma 9.1 as well as standard representation 
of the spectral density as sums of auto-covariances (cf., e.g., Corollary 4.3.2 
in Brockwell and Davis [5]). □ 

The next lemma gives the crucial step toward tightness of the partial sum 
process. 

Lemma 9.2. Under Assumptions V.l, B.1-B.3 it holds for u < v and 
some constant D > that 

[mv] 

+ o P (l))(v-u) 2 . 



/ imvj \* 

\ v l=[mu\ + l / 
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Proof. Equation (9.1) and Lemma 9.1 yield the assertion after some 
standard calculations. □ 

The next lemma gives the convergence of the finite-dimensional distribu- 
tion. 

Lemma 9.3. Let S* m (u) = ^E)=? J Z*(j). 

(a) If Assumptions V.l, B.1-B.3 are fulfilled and m/T — > we obtain for 
all < u±, . . . , Up < 1 in probability 

(S* m ( Ul ),...,S* m (u p ))-^N(0,E), 

where T, = (cjj)jj = i v .. jP with Cij = 2nf(0) min(ui,Uj). 

(b) If Assumptions V.l, B.l and B.4 are fulfilled we obtain for all < 
itl, . . . ,u p < 1 in probability 

(SU Ul ),...,S* T (u p ))^N(0,Z), 
where S = (ci ) j)i ) j=i ) ...j ) with Cij = 27r/(0)(min(uj, Uj) — UiUj). 

Proof. For assertion (a) we use the Cramer- Wold device and prove a 
Lyapunov-type condition. Again arguments similar to (9.1) are needed. To 
use this kind of argument it is essential that m/T — > because for m = T the 
Feller condition is not fulfilled, and thus the Lindeberg condition can also not 
be fulfilled. Therefore a different argument is needed to obtain asymptotic 
normality for m = T. We make use of the Cramer- Wold device and Lemma 3 
in Mallows [38]. As a result somewhat stronger assumptions are needed, but 
it is not clear whether they are really necessary (cf. also Remark 3.2). □ 

Proof of Theorem 3.1. Lemmas 9.2 and 9.3 ensure convergence 
of the finite-dimensional distribution as well as tightness, which imply by 
Billingsley [2], Theorem 13.5, 

[mu\ 

E*Z*(l)):0<u< 1 



{ 777* 
{W(u):0< u <l}, __►(), 
{B(u):0<u< 1}, m = T. □ 

Proof of Theorem 4.1. Concerning Assumption B.2 it holds by As- 
sumption A.l that 

^ p 
sup|var (x (k)) - irf{\ k )\ = sup|7r/(A fc ) - 7r/(A fc )| — > 0. 

k k 
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Similarly we obtain Assumption B.3 since 

sup E* (a;* (A;)) 4 = 3vr 2 sup/(A fc ) 2 < 3vr 2 sup/(A fc ) 2 + o P {\) <C + o P (l). 

k k k 



C ^ C* 

Concerning Assumption BA let X = iV(0, 1), then y ixf{\k)X = x*(k). Then 



supdl(C*(x*(k)),N(0,7Tf(X k ))) < nsnpi^fiXk)- y^)) 2 EX< 



<7rsup|/(A fc )-/(A fc )| = op(l). 

k 

The proofs especially of the last statement for the residual-based as well as 
local bootstrap are technically more complicated but similar. □ 

Proof of Corollary 4.1. We put an index V, ^respectively, V, on our 
previous notation indicating whether we use V or V in the calculation of 
it, for example, x^(j), xy(j), respectively, Uy(j), Uv{j) denote the Fourier 
coefficients based on V(-), respectively, V(-). 

We obtain the assertion by verifying that Assumptions A.l, A.2 as well as 
Assumption AA remain true. This in turn implies Assumptions B.2 as well 
as Assumption B.3. Concerning Assumption BA we show that the Mallows 
distance between the bootstrap r.v. based on V(-) and the bootstrap r.v. 
based on V(-) converges to 0. 

The key to the proof is 



Iv(j)-IvU) 



(9.2) 



where 



-^(F 2 (j) + F 2 (iV + j)) + 2-Lx v (j)F T (j) 
+ 2^=y v (j)F T (N+j), 



F T (j) ■-- 



^(y(t)-y(t))cos(tA J ; 

t=t 

T 



1<3<N, 
N <j< 2N. 



^(F(t)-F(t))sin(tA i _ JV ) 
k t=i 

This can be seen as follows: By Theorem 4.4.1 in Kirch [29], it holds that 

N 

(9.3) y^(cos(tiAj) cos^Aj) + sin(tiAj) s'mfoXj)) 



< 



N, 
1, 



h — 
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By (4.1) and an application of the Cauchy-Schwarz inequality this implies 

T 

\F T (j)\<^2\V(t)-V(t)\=o P (Ta- 1/2 ), 

2N T 
j=l t=l 

(9-4) 

= op(T 2 a^ 1 ). 
Equation (9.2) follows by 

xvti)-Xv(j)=T- 1/2 F T (j), 

yv(j)-yv(3)=T- 1/2 F T (N + j). □ 

Proof of Lemma 5.1. Some calculations show that the lemma essen- 
tially rephrases Theorem 2.1 in Robinson [53], respectively, Theorem 3.2 in 
Shao and Wu [55]. □ 

Proof of Lemma 5.2. Some careful calculations yield 
sup | cov(x(l),x(k)) - nf(Xk)Si k\ -> 0, 

l<l,k<N 

(9.6) 

sup \cov(y(l),y(k)) - n f (\ k )S l k \ ->• 0. 

l<l,k<N 

This implies by Ex(k) = Ey(fc) = and an application of the Markov in- 
equality yields 

i f Mi) m if vU) 

hence assertion (a). Similar arguments using Proposition 10.3.1 in Brockwell 
and Davis [5] yield assertions (b) and (c). □ 

Proof of Lemma 5.3. Similar arguments as in the proof of Corol- 
lary 2.2 in Shao and Wu [55] yield the result. Additionally an argument 
similar to Freedman and Lane [19] is needed. □ 

Proof of Lemma 5.4. Similar arguments as in the proof of Theo- 
rem A.l in Franke and Hardle [18] yield the result. □ 

Proof of Theorem 6.1. It is sufficient to prove the assertion of Corol- 
lary 4.1 under Hq as well as H±, then the assertion follows from Theorem 3.1 
as well as the continuous mapping theorem. By the Hajek-Renyi inequality 
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it follows under Ho that 

T 



T 

t=i 



i J>(t) - t/(t)) 2 = ±(M - Ml) 2 + ^(M - ft) 2 

-^(-ig^cno-iwi))))' 

WflogTU? i=i / 



(logT)k 

] 

(logT)(r-A:) i= |+i 



+ ^( , 1 5 £ (^(*)-E(V(t))) 



Op 



logT 



which yields the assertion of Corollary 4.1. Similarly, under alternatives 



t=i 



minffc.fc) . ^ ,o , , ^ ,o\k — k\ 
^-(jii - + \d + p j -p J \ 2 '-^ 

T — maxffc, k) , ^ n9 

+ T (M2 ~ /"2) 2 

Cp max , pr 



where d = p\ — {x-i and j = 2 if k < k and d = ^2 — pi and j = 1 otherwise, 
which yields the assertion of Corollary 4.1. □ 

PROOF of Theorem 6.2. Noting that Y*(k) = Ylj=i v *(j), the asser- 
tion follows from an application of Corollaries 4.1 and 3.1 as well as (6.4), 
since V(t) - V(t) = {p - p T )Y(t - 1). □ 

SUPPLEMENTARY MATERIAL 

Detailed proofs (DOI: 10.1214/10- AOS868SUPP; .pdf). In this supple- 
ment we give the detailed technical proofs of the previous sections. 
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